A New Large Urdu Database for Off-Line Handwriting Recognition
نویسندگان
چکیده
A new large Urdu handwriting database, which includes isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), and Urdu dates in different patterns, was designed at Centre for Pattern Recognition and Machine Intelligence (CENPARMI). It is the first database for Urdu off-line handwriting recognition. It involves a large number of Urdu native speakers from different regions of the world. Moreover, the database has different formats – true color, gray level and binary. Experiments on Urdu digits recognition has been conducted with an accuracy of 98.61%. Methodologies in image pre-processing, gradient feature extraction and classification using SVM have been described, and a detailed error analysis is presented on the recognition results.
منابع مشابه
Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملA Novel Comprehensive Database for Arabic Off-Line Handwriting Recognition
This paper presents the work toward developing a new comprehensive database for Arabic off-line handwriting recognition. The database includes: isolated Indian digits, numerical strings, Arabic isolated letters, and a collection of 70 Arabic words. Also, the database includes a free format sample of an Arabic date. A data entry form was designed to collect written samples from Arabic native spe...
متن کاملFrom Off-line to On-line Handwriting Recognition
On-line handwriting includes more information on the time order of the writing signal and on the dynamics of the writing process than off-line handwriting. Therefore, on-line recognition systems achieve higher recognition rates. This can be concluded from results reported in the literature, and has been demonstrated empirically as part of this work. We propose a new approach for recovering the ...
متن کاملStrategies for Combining On-line and Off-line Information in an On-line Handwriting Recognition System
This paper investigates the cooperation of on-line and off-line handwriting word recognition systems. Our goal is to improve a mature on-line recognition system by exploiting the complementary information present in the off-line representation built from on-line signal. After describing the on-line and off-line HMM based handwriting recognition systems, we propose a formal framework, which allo...
متن کاملHandwriting Recognition of Whiteboard Notes - Studying the Influence of Training Set Size and Type
This paper presents a system for the recognition of on-line whiteboard notes. Notes written on a whiteboard is a new modality in handwriting recognition research that has received relatively little attention in the past. For the recognition we use an offline HMM-recognizer, which is supplemented with methods for processing the on-line data and generating off-line images. The system consists of ...
متن کامل